AITopics | state mo

Mo' States Mo' Problems: Emergency Stop Mechanisms from Observation

Neural Information Processing SystemsFeb-6-2026, 10:20:53 GMT

In many environments, only a relatively small subset of the complete state space is necessary in order to accomplish a given task. We develop a simple technique using emergency stops (e-stops) to exploit this phenomenon. Using e-stops significantly improves sample complexity by reducing the amount of required exploration, while retaining a performance bound that efficiently trades off the rate of convergence with a small asymptotic sub-optimality gap. We analyze the regret behavior of e-stops and present empirical results in discrete and continuous settings demonstrating that our reset mechanism can provide order-of-magnitude speedups on top of existing reinforcement learning methods.

artificial intelligence, machine learning, reinforcement learning, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.62)

Add feedback

Reviews: Mo' States Mo' Problems: Emergency Stop Mechanisms from Observation

Neural Information Processing SystemsJan-25-2025, 20:42:30 GMT

The paper proposes a method for improving convergence rates of RL algorithms when one has access to a set of state-only expert demonstrations. The method works by modifying the given MDP so that the episode terminates whenever the agent leaves the set of states that had high-probability under the expert demonstrations. The paper then proves an upper bound on the regret incurred using their algorithm (as compared to the expert) in terms of the regret for the RL algorithm that is used to solve the modified MDP. The paper presents a set of experiments showing that the proposed mechanism can effectively strike a tradeoff between convergence rate and optimality. The clarity of the exposition is quite high, and the paper is easy to follow.

algorithm, emergency stop mechanism, expert demonstration, (10 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.96)

Add feedback

Reviews: Mo' States Mo' Problems: Emergency Stop Mechanisms from Observation

Neural Information Processing SystemsJan-25-2025, 20:42:20 GMT

The paper proposes a method for stopping unnecessary exploration in RL with a bounded regret on the loss. The stopping method, called e-stop, learns from state-only demonstrations provided by an expert. The paper is very well-written and clear to follow. The theoretical analysis of the method is compelling. The experiments are rather minimalistic, but they support the theoretical analysis.

emergency stop mechanism, state mo, theoretical analysis

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.35)

Add feedback

Mo' States Mo' Problems: Emergency Stop Mechanisms from Observation

Neural Information Processing SystemsOct-10-2024, 13:21:23 GMT

In many environments, only a relatively small subset of the complete state space is necessary in order to accomplish a given task. We develop a simple technique using emergency stops (e-stops) to exploit this phenomenon. Using e-stops significantly improves sample complexity by reducing the amount of required exploration, while retaining a performance bound that efficiently trades off the rate of convergence with a small asymptotic sub-optimality gap. We analyze the regret behavior of e-stops and present empirical results in discrete and continuous settings demonstrating that our reset mechanism can provide order-of-magnitude speedups on top of existing reinforcement learning methods.

emergency stop mechanism, state mo

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.70)

Add feedback

Mo' States Mo' Problems: Emergency Stop Mechanisms from Observation

Ainsworth, Samuel, Barnes, Matt, Srinivasa, Siddhartha

Neural Information Processing SystemsMar-19-2020, 03:01:43 GMT

In many environments, only a relatively small subset of the complete state space is necessary in order to accomplish a given task. We develop a simple technique using emergency stops (e-stops) to exploit this phenomenon. Using e-stops significantly improves sample complexity by reducing the amount of required exploration, while retaining a performance bound that efficiently trades off the rate of convergence with a small asymptotic sub-optimality gap. We analyze the regret behavior of e-stops and present empirical results in discrete and continuous settings demonstrating that our reset mechanism can provide order-of-magnitude speedups on top of existing reinforcement learning methods. Papers published at the Neural Information Processing Systems Conference.

emergency stop mechanism, state mo

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.69)

Add feedback

Filters

Collaborating Authors

state mo

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Mo' States Mo' Problems: Emergency Stop Mechanisms from Observation

Reviews: Mo' States Mo' Problems: Emergency Stop Mechanisms from Observation

Reviews: Mo' States Mo' Problems: Emergency Stop Mechanisms from Observation

Mo' States Mo' Problems: Emergency Stop Mechanisms from Observation

Mo' States Mo' Problems: Emergency Stop Mechanisms from Observation